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Novel Inducible Genes FROM Alfalfa 
AND Methods of Use Thereof 



The present invention relates to recombinant protein production in plants. 
More particularly, the present invention relates to novel inducible genes that are 
expressed upon harvest, methods for isolating such genes, and methods for using 
these genes or components therefrom. 

BACKGROUND OF THE INVENTION 

The mass production of recombinant molecules of commercial value is a 
technical area of increasing complexity and interest. Many different organisms have 
been considered as hosts for foreign protein expression including single*cell 
organisms such as bacteria and yeast, cell cultures of animals, fungi and plants, and 
whole organisms such as plants, insects, fiingi and transgenic animals. In general, 
each particular organism has unique characteristics that may offer advantages for 
production of specific proteins of interest. Alternatively the specificity of certain 
protein production platforms may limit utility for widespread applications. Thus, 
numerous molecular farming systems have been developed as a means to produce 
proteins of commercial interest. 

Of particular interest to the subject matter of the present invention is the 
expression of heterologous proteins in plant cells. Numerous foreign proteins have 
been expressed in whole plants and selected plant organs. Plants can offer a hi^y 
effective and economical means to produce recombinant proteins as they can be 
grown on a large scale with modest cost inputs and most conmiercially important 
species can now be transformed. 

In order to optimize protein production and recovery, a number of factors need 
to be considered. These include the levels of recombinant protein production, the 
temporal aspects of recombinant protein production, and the stability of the final 
product within the plant cell. The level of protein production must be sufficient to 
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allow accumulation of the product in quantities that are commercially valuable and 
can be conveniently isolated. In many instances, it may be desired that the temporal 
expression of the product coincide with the period when the crop is harvested or 
collected. In addition, it may be required that ttie protein stably accumulate to 
appreciable levels, or be induced to quickly accimiulate to appreciable levels if the 
product is intrinsically unstable. 

The production of heterologous proteins in plants has been achieved using a 
variety of approaches. US 6,650,307, US 5,716,802, US 5,763,748 disclose 
recombinant protein production using transcriptional fusions to a constitutive plant 
promoter. Production of heterologous proteins in seed (US 5,504,200; US 5,530,194; 
US 6,905,186; US 5,792,922; US 5,948,682), fruit (US 6,783,394; US 4,943,674) or 
storage organs such as tubers (US 5,436,393, US 5,723,757) have also been described. 

A disadvantage of constitutive expression systems is that constitutive 
expression of a protein may lead to toxic effects with regards to plant growth. 
Furthermore, it is difficult to predict what interactions a foreign protein may have 
with other plant proteins, such as enzymes or recqptors, plant membranes, such as 
those of the endoplasmic reticulum, Golgi apparatus, vacuole and plasmalemma, or 
the host of other molecules critical to the growth and development of the plant. 
Another potential disadvantage of a constitutive or non-inducible promoter is the 
metaboHc cost of synthesizing the transgenic protein in all tissues at all stages of 
growtii. If the only tissue to be harvested is the leaves, for example, it is inefficient 
and wasteful for the plant to produce the foreign protein in other tissues. 
Alternatively, if the transgene encoded protein is labile or imstable, then production of 
the protein, constitutively, throughout the growth of the plant is inefficient. 

Inducible systems allow the expression of an introduced gene to take place at a 
desired time in the development of a plant, under specific circumstances or in specific 
tissues. For example, leaf-specific promoters or promoters induced in the leaves by 
some treatment would restrict synthesis to only the harvested tissue. In addition, an 
induced foreign gene is potentially less likely to imdergo gene silencing than a 
transgene controlled by a constitutive or tissue specific promoter. Furthermore, 
inducible transg^e systems offer a method of biological containment since the 
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foreiga prdtein is not present in the crop until the application of the inducing 
treatment, at which time the crop is harvested. Containment of a protein produced 
from a foreign gene is as, or more, important than containment of the gene as the 
protein is the biologically active component. 

Gene expression in response to plant woxmding is another potential source of 
an inducible system. US 5,689,056, US 5,670,349, US 5,929,304, and US 5,777,200 
disclose the use of regulatory elements from wound inducible genes for the induction 
of heterologous protein synthesis in plants. However, the value of these wound- 
inducible promoters may be limited since wounding of the plant also induce other 
genes, such as proteases, that can negatively impact the production of the recombinant 
protein. It is also not clear that these regulatory elements provide sufficient levels of 
expression to cause accumulation of the recombinant protein to substantial levels, 
especially when the response is localized to the site of wounding (e.g. HMG2 
promoter, US 5,689,056). Although the expression levels with such promoters can be 
enhanced by applying more extensive wounding treatments or chemical inducers such 
as methyl jasmonate, this entails additional costs. 

Thus, although promoters involved in inducible systems can provide powerful 
tools for control of transgenes in plants, many obstacles are faced in utilizing these 
regulatory elements. Inducible promoter systems must enable the precise timing and 
location of expression of such transgenes in order to be commercially useful. In this 
regard, regulatory elements that can be induced under precise conditions amenable to 
cultivation practices are desired. More particularly, there is a need for regulatory 
elements that are induced, specifically, during harvesting conditions. ^, 

Volenec et al. ("Molecular analysis of alfalfsi root vegetative storage proteins" 
pp59-73 in Molecular and Cellular Technologies for Forage hnprovooient, CSSA 
Spec Publ. No. 26, 1998) have characterized the changes that ensue in root tissue 
following harvest and shoot regrowth of alfalfa {Medicago sativa L.). However, no 
specific regulatory elements were identified or characterized in any manner. 

Ferullo et al (Crop. Sci. 1996 36, 1011-1016) disclose proteins that are 
specific to harvesting conditions of alfalfa. However the structure or function of these 
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proteins was not characterized and is unknown; moreover, there is no indication of the 
nature of die genes expressed in harvested shoot tissue of alfalfa during harvesting. 
Furthermore, there is no suggestion as to the use of regulatory elements associated 
with these genes for induction of heterologous gene expression in plants in a harvest- 
inducible manner. 

Coupe et al. (WO 00/31251) disclose the characterization of a promoter from 
asparagine synthetase and its use in post harvest gene expression. 

It is an object of the invention to overcome disadvantages of the prior art. 

The above object is met by the combinations of features of the main claims, 
the sub-claims disclose further advantageous embodiments of the invention. 
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The present invention relates to recombinant protein production in plants. 
More particularly, the present invention relates to novel inducible genes that are 
expressed upon harvest, methods for isolating such genes, as well as methods for 
using these genes. 

The present invention provides a method (A) for isolating a harvest-inducible 
DNA sequence comprising: 

i) constructing one or more first cDNA libraries comprising cDNA sequences 
expressed in harvested tissue; 

ii) preparing one or more second cDNA libraries comprising cDNA sequences 
expressed in tissues of an intact plant prior to harvest; and 

iii) identifying harvest-inducible cDNA sequences. 

The expression of the harvest-inducible cDNA sequences may be analyzed to 
determine inducibility of the harvest-inducible cDNA sequences upon harvesting. 

An example of identifying harvest-induced cDNA sequences (step iii)) include 
subtractive hybridization of the first cDNA library with an excess of the second 
cDNA library, however, other methods may also be used as known in the art. 

The present invention also relates to an isolated harvest-inducible cDNA 
sequence obtained according to the above method (A). 

The present invention embraces an isolated harvest-inducible cDNA sequence 
selected from the group consisting of: 

i) SEQ ID NO:l, a complement thereof, a firagment of SEQ JD NO:l, a 
complement of a fragment of SEQ ID NO:l, a nucleic acid that hybridizes to SEQ ID 
NO:l under stringent hybridization conditions, a nucleic acid that hybridizes to a 
complement of SEQ ID NO:l under stringent hybridization conditions, a nucleic acid 
that hybridizes to a firagment of SEQ ID N0:1 under stringent hybridization 
conditions, or a nucleic acid that hybridizes to a complement of firagment of SEQ ID 
NO: 1 under stringent hybridization conditions; 
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ii) SEQ ID NO:2, a complement thereof, a fragment of SEQ ID NO:2, a 
complement of a fragment of SEQ ID NO:2, a nucleic acid tbat hybridizes to SEQ ID 
NO:2 under stringent hybridization conditions, a nucleic acid that hybridizes to a 
complement of SEQ ID NO:2 under stringent hybridization conditions, a nucleic acid 
that hybridizes to a fragment of SEQ ID NO:2 under stringent hybridization 
conditions, or a nucleic acid that hybridizes to a complement of fragment of SEQ ID 
NO:2 under stringent hybridization conditions; and 

iii) SEQ ID NO:3, a complement thereof, a fragment of SEQ ID NO:3, a 
complement of a fragment of SEQ ID NO:3, a nucleic acid that hybridizes to SEQ ID 
NO:3 under stringent hybridization conditions, a nucleic acid that hybridizes to a 
complement of SEQ ID NO:3 imder stringent hybridization conditions, a nucleic acid 
that hybridizes to a fragmrat of SEQ ID NQrS under stringent hybridization 
conditions, or a nucleic acid that hybridizes to a complement of fragment of SEQ ID 
NO:3 under stringCTit hybridization conditions, 

the stringent hybridization conditions comprising, hybridization overnight (12-24 hrs) 
at 42''C in the presence of 50% formamide, followed by washing, or 5X SSC at about 
eS^'C for about 12 to about 24 hours, followed by washing in O.IX SSC at 65*^0 for 
about one hour. 

Also provided in this invention is a method (B) for isolating a harvest 
inducible regulatory element comprising, 

i) identifying genomic DNA sequences 3' and 5* corresponding to the harvest- 
inducible cDNA identified using method (A); and . 

ii) analyzing the genomic DNA, arid identifying the harvest-inducible 
regulatory element. 

This method (B) may ftirther comprise a step of: 

iii) testing the harvest-inducible regulatory region within a transgenic plant or 
plant cell. 

The present invention also provides a harvest-inducible regulatory element . 
obtained using the method (B). 

The present invention also pertains to a harvest-inducible regulatory element 
selected from the group consisting of: 
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i) SEQ ID NO:4, a complement thereof, a fragment of SEQ ID NO:4, a 
complement of a fragment of SEQ ID NO:4, a nucleic acid that hybridizes to SEQ ID 
N0:4 mider stringent hybridization conditions, a nucleic acid that hybridizes to a 
complement of SEQ ID NO:4 imder stringent hybridization conditions, a nucleic acid 
that hybridizes to a fragment of SEQ ID NO:4 under stringent hybridization 
conditions, or a nucleic acid that hybridizes to a complement of fragment of SEQ ID 
N0:4 under stringent hybridization conditions; 

ii) SEQ ID NO:5, a complement thereof, a fragment of SEQ ID NO:5, a 
complement of a fragment of SEQ ID NO:5, a nucleic acid that hybridizes to SEQ ID 
NO:5 under stringent hybridization conditions, a nucleic acid that hybridizes to a 
complement of SEQ ID NO:S under stringent hybridization conditions, a nucleic acid 
that hybridizes to a fragment of SEQ ID NO:5 imder stringent hybridization 
conditions, or a nucleic acid that hybridize^ to a complement of fragment of SEQ ID 
NO: 5 under stringent hybridization conditions; and 

iii) SEQ ID NO:6, a complement thereof, a fragment of SEQ ID NO:6, a 
complement of a fragment of SEQ ID NO:6, a nucleic acid that hybridizes to SEQ ID 
NO:6 under stringent hybridization conditions, a nucleic acid that hybridizes to a 
complement of SEQ ID NO:6 under stringent hybridization conditions, a nucleic acid 
that hybridizes to a fragment of SEQ ID NO:6 xmder stringent hybridization 
conditions, or a nucleic acid that hybridizes to a complement of fragment of SEQ ID 
NO:6 under stringent hybridization conditions, 

the stringent hybridization conditions comprising, hybridization ovemi^t (12-24 hrs) 
at 42**C in the presence of 50% formamide, followed by washing, or 5X SSC at about 
65^C for about 12 to about 24 hours, followed by washing in O.IX SSC at 65°C for 
about one hom:, wherein the regulatory element exhibits harvest-inducible activity. 

Also provided in the preset inv«ition is a coiistract comprising the harvest- 
inducible regulatory element as just defined, operably linked with a heterologous 
nucleotide sequence of interest and a terminator region. The present invention also 
embraces a vector comprising the DNA constract as just defined. Fxulhennore, this 
invention pertains to a plant, plant tissue, plant seed, plant cell, or progeny therefrom, 
comprising the constract as just defined. 
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The present invention relates to a construct comprising a heterologous 
nucleotide sequence operably liiiked to said harvest-inducible regulatory element 
defined above, where the harvest-inducible regulatory element further comprises a 
nucleotide sequence encoding a harvest-inducible protein or firagment thereof. The 
present invention also embraces a vector comprising the DNA construct as just 
defined. Furthermore, this invention pertains to a plant, plant tissue, plant seed, plant 
cell, or progeny therefi-om, comprising the construct as just defined 

The present invention also provides a method (C) for production of a 
heterologous protein into a plant comprising: 

i) introducing a constmct comprising a harvest-inducible regulatory element 
operably hnked with a heterologous nucleotide sequence of interest and a terminator 
region, to the plant to obtain a transformed plant, where the harvest-inducible 
regulatory element is selected firom the group consisting of: 

SEQ ID NO:4, or a fiagment thereof; 
SEQ ID NO:5, or a fragment thereof; 
SEQ ID NO:6, of a Augment thereof; 

a nucleic acid that hybridizes to SEQ ID NO:4, 5, 6, or a complement 
of SEQ ID NO:4, 5, 6 under stringent hybridization conditions; and 

a nucleic acid that hybridizes to a firagment of SEQ ID NO:4, 5, 6, or a 
complement of SEQ ID NO:4, 5, 6 under stringent hybridization conditions, 
the stringent hybridization conditions comprising, hybridization overnight (12-24 hrs) 
at 42*'C in the presence of 50% formamide, followed by washing, or 5X SSC at about 
65*^0 for about 12 to about 24'hours, followed by washing in O.IX SSC at 65^C for 
about one hour; 

ii) growing the transformed plant; and 

iii) harvesting the transformed plant thereby inducing expression of the 
heterologous protein. 

The step of harvesting (step iii) may be followed by: 

iv) isolating the heterologous protein firom the transformed plant. 
Furthermore, the step of isolating (step iv)) may be followed by a step of purification 
of the heterologous protein. 
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The present invention also pertains to a method (D) for production of a 
heterologous protein comprising, 

i) providing a plant transformed with a construct comprising a harvest- 
inducible regulatory element operably linked with a heterologous nucleotide sequence 
of interest and a terminator region, where the harvest-inducible regulatory element is 
selected from the group consisting of 

SEQ ID NO:4, or a fragment thereof; 
SEQ ID NO: 5, or a fragment thereof; 
SEQ ID NO:6, of a fragment thereof; 

a nucleic acid that hybridizes to SEQ ID NO:4, 5, 6, or a complement 
of SEQ ID NO:4, 5, 6 under stringent hybridization conditions; and 

a nucleic acid that hybridizes to a fragment of SEQ ED NO:4, 5, 6, or a 
complement of SEQ ID NO:4, 5, 6 under stringent hybridization conditions, 
the stringent hybridization conditions comprising, hybridization overnight (12-24 hrs) 
at 42**C in the presence of 50% formamide, followed by washing, or 5X SSC at about 
65°C for about 12 to about 24 hours, followed by washing in O.IX SSC at SS^'C for 
about one hour, and the harvest-inducible regulatory element fiuther comprises a 
nucleotide sequence encoding a harvest-inducible protein or fragment thereof; 

ii) growing the transformed plant; and 

iii) harvesting the transformed, plant to induce expression of the heterologous 
protein. 

The step of harvesting (step iii) may be followed by: 

iv) isolating the heterologous protein from the transformed plant. 
Furfheraiore, the step of isolating (step iv)) may be followed by a step of purification 
of the heterologous protein. 

The harvest-inducible regulatory elements can be used to control the 
expression of a heterologous DNA sequence, such that tiie heterologous DNA 
sequence is only expressed in response to harvesting, thus providing a convenient 
system for the production of novel proteins. Accordingly, another aspect of the 
present invention is directed to DNA constructs comprising a harvest-inducible 
regulatory element operably linked with a heterologous nucleotide sequence of 
interest and a terminator region. 
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In order to enhance translation, stability or recovery of the het^logous or 
foreign protein, the nucleotide sequence encoding the heterologous protein can be 
operably linked to a harvest-inducible gene encoding a portion of a harvest-inducible 
protein and its corresponding harvest-inducible regulatory elemrat. Accordingly, 
another aspect of the present invention relates to a DNA construct comprising a 
heterologous nucleotide sequence encoding a heterologous protein of interest 
operably linked to an isolated harvest inducible regulatory element and a portion of 
the harvest-inducible gene encoding a harvest-inducible protein or fragment thereof. 

The DNA constructs may be ligated or incorporated into an appropriate vector 
and used to transform plants in order to express heterologous proteins in plants. 
Accordingly, another aspect of the invention is directed to a plant, plant tissue, plant 
seed, or plant cell comprising a harvest-inducible regulatory element operably linked 
with a hetorologous nucleotide sequence and a terminator region. 

in yet another aspect of the invention, transgenic plants are produced, the 
plants comprising a harvest-inducible transgene, the transgene comprising a harvest- 
inducible regulatory element operably linked to a heterologous nucleotide sequence 
and a terminator region. The transgene may encode a protein of veterinary or 
pharmaceutical or biological activity, where the activity is useful for administration to 
livestock by feeding of whole or parts of harvested plant. 

This summary of the invention does not necessarily describe all necessary 
features of the invention but that the invention may also reside in a sub-combination 
of the described features. 
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BIUEF DESCRIPTION OF THE DRAWINGS 



These and other features of the invention will become more apparent from the 
following description in which reference is made to the appended drawings wherein: 

FIGURE 1 shows a schematic diagram of PCR-Select cDNA substraction library 
method for isolating harvest-inducible cDNA clones (see Example 1 for more 
detail of the method). cDNA containing harvest-specific transcripts is referred 
to as the '^tester*' cDNA and the cDNA from the non-harvested plants, referred 
to as "driver." cDNA Type "e" molecules are formed only if the sequence is 
up-regulated in the tester cDNA. Solid lines represent the Rsa I digested tester 
or driver cDNA. Solid boxes rq)resent the outer part of the Ad^tor 1 and 2R 
longer strands and corresponding PCR primer 1 sequence. Clear boxes 
represent the inn^ part of Adaptor 1 and the corresponding nested PCR primer 
1 sequence; shaded boxes represent the inner part of Adaptor 2R and the 
corresponding nested PCR primer 2R sequence 

FIGURE 2 shows sequences of PCR-Select cDNA synthesis primer (SEQ ID 
NO:12), adaptors 1 and 2R (SEQ ED Nos: 13-14), PCR primer 1 (SEQ ID NO: 
15) and nested PCR primers 1 and 2R (SEQ ID Nos: 16-17). When the 
adaptors are ligated to iZi^ai-digested cDNA, the Rsalsite is restored. 

FIGURE 3 shows Northern blot analysis of the expression of cDNA H7 following 
harvest of leaf tissue. RNA was isolated from alfalfa leaves and probed with 
H7 (SEQ ID NO:l). Leaves obtained before harvest (la^es 1, 5), 45 min post 
harvest (lanes 2, 6), 6 hours post harvest (lanes 3, 7) and 24 hours post harvest 
(lanes 4, 8). H7 RNA is not detected in alfalfa leaves in non-harvest, i.e. pre- 
harvest conditions nor following wounding or heat treatments (data not 
shown). 

FIGURE 4 shows Northem blot analysis of cDNA H11(SEQ ID NO:2) imder 
harvesting and heat shock conditions of treatment of alfalfa leaves. RNA 
extracted from leaves of: lane 1, non-treated plants; lanes 2-5, plants in 
harvested conditions for 30 min, 2 hours, 6 hours, 24 hours; lanes 6 and 7, 
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plants subjected to 15 or 30 min of heat shock at 38^C. RNA was probed with 
HI 1 cDNA clone (SEQ ID N0:2). Arrow indicates major transcript of HI 1. 

FIGURE 5 shows Northern blot analysis of cDNA HI 1 (SEQ ID NO:2) following 
wounding of alfalfa leaves. RNA extracted from: lane 1, non-wounded plants; 
lanes 2-4, plants were wounded using a scalpel and RNA extracted after 45 
min, 6, hours, and 24 hours post wounding. RNA was probed with Hll 
cDNA clone.(SEQ ID NO:2) 

FIGURE 6 shows a Northern blot analysis of cDNA clone H12 (SEQ ID NO:3) 
following harvesting and heat shock treatments of alfalfa leaves. RNA 
extracted from leaves of: lane I, non-treated plants; lanes 2-5, plants in 
harvested conditions for 30 min, 2 hours, 6 hours and 24 hours; lanes 6 and 7, 
plants subjected to 15 or 30 min of heat shock at 38**C. 

FIGURE 7 shows a diagram of a vector constract containing the putative promoter 
region of an HI gene and a GFP reporter gene. The arrows represent the left 
and right borders of the T-DNA region of a binary vector used in 
Agrobactrium-mQdiatcd gene transfer. P represents a promoter used to drive a 
selective marker such as the resistance gene to the antibiotic neomycin, and T 
represents a terminator regulatory element such as that derived from nopaline 
synthase, the CaMV 35S gene or from a plant gene such as H7. The 
GFP(EGF) coding region -following the H7 promote: could represent the 
coding region of the fluorescent protein, a fiision protein consisting of GFP 
and EGF (epidermal growth fector), or simply the coding region of EOF alone 
or any other sequence encoding a peptide or protein with medical or veterinary 
properties. 

FIGURE 8 shows the H7 genomic sequence (SEQ ID NO:7), including the 5' 
flanking regulatory, and coding, regions. The regulatory region is from 
nucleotide 1 to nucleotide 634 (SEQ ID NO:4) which is the transcription 
initiation site (bold, large A), Putative TATA boxes are enclosed in a boxed 
outline. The coding region of the H7, gene is in bold italics and begins at 
nucleotide 675 and ends at nucleotide 1148 (SEQ ID NO:l); the single letter 
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amino acid sequence of tbe protein is under the DNA sequence. The 3' UTR 
starts at 1 149 up to the poly A sequence. 

FIGURE 9 shows the HI 1 genomic sequence (SEQ ID NO:8) and associated regions. . 
Regulatory region (SEQ JD NO:5, nucleotides 1 to about 438), intron 
(nucleotides 651-772) and 3' UTR (nucleotides 1239 to the polyA sequence) 
are in lower case; coding region (nucleotides 439-650 and 773-1238; SEQ ID 
NO:2) is in upper case and bold. 

FIGURE 10 shows the H12 genomic sequrace (SEQ ID NO:9) and associated 
regions. Regulatory region (nucleotides 1-936; SEQ ID NO:6), 3' UTR 
(nucleotides 1720-1906); coding region (nucleotides 976-1720; SEQ ID NO:3) 
is in upper case and bold. 

FIGURE 11 shows binary vectors containing harvest-inducible promoters fused to 
the GUS gene. Pro: promoter; T: terminator; RB/LB: right/left borders from T- 
DNA region of Ti plasmid of Agrobacterium; 358: from the regulatory region 
of the 35S transcript of the cauUflower mosaic virus; ^: catalase intron in GUS 
gene. 

FIGURE 12 shows the expression of H7-GUS in tobacco at time zero (left) and 24 
hours post-harvest (rigjit). 

FIGURE 13 shows GUS expression in random samples from M truncatula plants 
grown from seedling co-cultivated with Agrobacterium. Upper left plate: 
H12-GUS (24 hoxirs post-harvest); upper right plate: HI 1-GUS (24 hours post- 
harvest); lower left: H7-GUS (24 hours post-harvest); lower middle: 35S-GUS 
(24 hours post-harvest); lower right: untransformed control. The chimeric 
nature of transformation events results in non-blue sectors of plants, and hence 
leaves and stems showing no blue coloration. 
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DESGRBPTION OF PREFERRED EMBODIMENT 



The present invention relates to recombinant protein production in plants. 
More particularly, the present invention relates to novel inducible genes that are 
expressed upon harvest, and methods to use these genes . 

The following description is of a preferred embodiment by way of example 
only and without limitation to the combination of features necessary for carr)ang the 
invention into effect 

The singular forms "a," "an" and "the" include plural reference unless the 
context clearly dictates otherwise. 

Two DNA sequences are "operably linked" if the nature of the linkage does 
not interfere with the ability of the sequences to effect their normal functions relative 
to each other. For instance, a promoter, or a regulatory region would be operably 
linked to a coding sequence if the promoter or regulatory region were enable of 
effecting transcription of that coding sequence. 

By "regulatory region" or "regulatory element" it is meant a nucleic acid 
sequence that has the property of controlling the expression of a DNA sequence that is 
operably linked with the regulatory region. Such regulatory regions may include 
promoter or enhancer regions, and other regulatory elements recognized by one of 
skill in the art. By '^promoter** it is meant the nucleotide sequences at the 5' end of a 
coding region, or fragment thereof that contain all the signals essential for the 
initiation of transcription and for the regulation of the rate of transcription. 

The term "gene" is used in accordance with its usual definition in flie art to 
mean an operatively linked group of nucleic acid sequences. By operatively linked it 
is meant that the particular sequences interact either directly or indirectly to carry out 
their intended function, such as mediation or modulation of gene expression. The 
interaction of operatively linked sequences may for example be mediated by proteins 
that in turn interact with the sequences. A transcriptional regulatory region and a 
sequence of interest are operably linked when the sequences are functionally 
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coimected so as* to permit transcription of the sequence of interest to be mediated or 
modulated by the transcriptional regulatory region. 

By "coding sequence of interest" it is meant any coding sequence that is to be 
expressed in a transformed plant. Such a coding sequence of interest may include, but 
is not limited to, a coding sequence that encodes an antigen, such as a viral coat 
protein or microbial cell wall or toxin proteins or various other antigenic peptides, 
such as swine viral antigen. Other proteins or peptides of interest include growth 
factors, such as epidermal growth factor, antimicrobial peptides, such as defensins, 
and other peptides with physiological and immimological properties, such as opioids 
and cytokines, or other pharmaceutically active proteins. Such proteins include, but 
are not limited to, interleuldns, insulin, G-CSF, GM-CSF, hPG-CSF, M-CSF or 
combinations thereof, interferons, for example, interferon-Doc, intCTferon-6, 
interferon-T, blood clotting factors, for example. Factor VDI, Factor IX, or tPA or 
combinations thereof. Furthermore, a coding sequence of interest may also encode an 
industrial enzyme, protein supplement, nutraceutical, or a value-added product for 
feed, food, or both feed and food use. Examples of such proteins include, but are not 
limited to proteases, oxidases, phytases, chitinases, invertases, lipases, cellulases, 
xylanases, enzymes involved in oil biosynthesis etc. Other protein supplements, 
nutraceuticals, or a value-added products include native or modified seed storage 
proteins and the like. The invention is not limited by the source or the use of the 
recombinant polypeptide or heterologoiis nucleotide sequence encoding the 
polypeptide. 

A "transgenic" organism, such as a transgenic plant, is an organism into which 
foreign DNA has been introduced. A "transgenic plant" encompasses all descendants, 
hybrids, and crosses thereof, whether reproduced sexually or asexually, and which 
continue to harbour the foreign DNA. 

A "vector" may be any of a number of nucleic acid sequences into which a 
desired sequence may be inserted by restriction and ligation. A vector typically carries 
its own origin of rqplioation, one or more unique recognition sites for restriction 
endonucleases which can be used for the insertion of foreign DNA, and. usually 
selectable markers such as genes coding for antibiotic resistance or herbicide 
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resistance, and often recognition sequences (e.g. promoter) for the expression of the. 
inserted DNA. Conunon vectors include, but are not limited to, viral vectors, 
plasmids, phage, phagmids, and cosmids. Vectors may also be modified to contain a 
region of homology to an Agrobacterium twnefaciens vector, preferably a T-DNA 
border region from Agrobacterium timefaciens. Fmther, vectors can comprise a 
disarmed plant tumor inducing plasmid of Agrobacterium tumefaciens. 

Unless defined otherwise all technical and scientific terms used herein have 
the same meaning as commonly understood to one of ordinary skill in the art to which 
this invention belongs. 

A nucleotide sequence is said to exhibit •*harvest-inducible regulatory activity" 
when the nucleotide sequence (the first nucleotide sequence, or harvest inducible 
regulatory element) regulates expression of a second nucleotide sequence to which it 
is operably linked, following harvesting of plant tissue. A regulatory region (the first 
nucleotide sequence) that exhibits harvest-inducible regulatory activity (a harvest- 
induciblc regulatory element) may also exhibit activity under other conditions for 
example but not limited to, wounding, heat shock, or other environmental stresses. 
Harvest-inducible regulatory activity may result in an increase in the expression of the 
second nucleotide sequence, or a decrease in the expression of the second nucleotide 
sequence, when compared to the expression of the second nucleotide sequence imder 
non-harvest conditions. A harvest-inducible regulatory element may therefore be 
active in increasing or decreasing expression of a second nucleotide sequence to 
which it is operably linked, relative to the expression of the second nucleotide 
sequence under non-harvest conditions. 

The present invention provides regulatory elements obtained firom genes that 
exhibit modified expression ujpon harvest of plant tissue. Furthermore, tiie present 
invention plains to the use of these regulatory regions for the expression of 
heterologous proteins in plants. The present invention is also directed to chimeric 
constructs containing a DNA of interest operatively linked to a harvest-inducible 
regulatory element of the present invention. Any exogenous gene, or gene of interest 
comprising a coding sequence of interest, can be used and manipulated according to 
the present invention to result in the expression of the exogenous gene. 
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Harvesting, as is typically carried out in the field involves cutting of plants at 
the base of the stem at a desired stage of growth, for example but not limited to the 
late bud stage, and laying cut material in a swath followed by drying at ambient field 
moisture and temperature conditions to a specific moisture level appropriate for 
baling or ensiling. 

The present invention provides a method to isolate harvest-inducible genes 
comprising: 

i) constmcting a cDNA subtraction library using any suitable method 
known in the art, from harvested and non-harvested tissues and identifying clones 
unique to the harvested tissues; and 

ii) identifying sequences preferentially expressed in response to 

harvesting. 

These harvest- inducible cDNA sequences may be characterized using 
Northern analysis and sequencing. 

Examples of harvest-induced cDNA sequences that are preferentially 
expressed in response to harvesting conditions, and generally not expressed under 
other conditions typical of cultivation, include, but are not limited to H7, HI 1 and 
H12 (SEQ ED NO*s: 1-3, respectively), firagments thereof, sequences that hybridize to 
SEQ ID NO's:l-3, firagments thereof under stringent hybridization conditions as 
known in the art, and compl^ents of these sequences, or sequeiices that exhibit a 
80% - 100% similarity using sequ^ice alignment protocols, for example, but not 
limited to, BLAST. The coding region of H7 (SEQ ID NO:l) comprises nucleotides 
675-1148 of Figure 8. The coding region of HU (SEQ ED NO:2) comprises 
nucleotide about 439-650 and nucleotides 773-1238 of Figure 9. The coding region of 
H12 (SEQ ID NO:3) comprises nucleotides about 976-1720 of Figure 10. 

Stringent hybridization conditions are known within the art (e.g. Sambrook et 
al, 1989, in ^Molecular cloning: a laboratory manual", 2^^ edition, Cold Spring 
Harbor, N.Y.: Cold Spring Harbor Laboratory, which is incorporated herein by 
reference), and may comprise, hybridization overnight (12-24 hrs) at 42*'C in the 
presence of 50% formamide, followed by washing using standard protocols 
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(Sambrook et al, 1989), or 5X SSC at about 65*^0 for about 12 to about 24 hours, 
followed by washing in 0.1 X SSC at 65®C for about one hour. 

Sequence comparisons between two or more polynucleotides (or polypeptides, 
as required) may be performed by comparing portions of the two sequ^ices over a 
comparison window to identify and compare local regions of sequence similarity^ 
The percentage similarity is calculated by: (a) determining the nimiber of positions at 
which the identical nucleic acid base or amino acid residue occurs in both sequences 
to yield the mmaber of matched positions; (b) dividing the number of matched 
positions by the total number of positions in the window of comparison; and, (c) 
multiplying the result by 100 to yield the percentage of sequ^ce identity.. Optimal 
alignment of sequraces for comparison may be conducted by utilizing readily 
available sequence comparison and multiple sequence alignment algorithms are, 
respectively, the Basic Local Alignment Search Tool (BLAST) (Altschul, S.F. et al. 
1990. J. MoL Biol. 215:403; Altschul, S.F. et al. 1997. Nucleic Acids Res. 25: 3389- 
3402) and ClustalW programs. BLAST is available on the Internet at 
http://www.ncbi.nlm.nih.gov and a version of ClustalW is available at 
http://www2.ebi.ac.uk using default parameters (for example but not Umited to. 
Program: blastn; Database:nr; low complexity; Expect 10; Word size 11). 

Using the above method, harvest-inducible cDNA's may be identified and 
characterized. For example, which is not to be considered limiting in any manner, 
expression of H7 or HI 2 is not detected in pre-harvested plant material yet expression 
increases significantly after tissue is harvested (see Figures 3 and 6, respectively). 
Similarly, Hll expression increases significantly after harvesting (see Figure 4). 
However, increase in expression of Hll is also observed in response to heat shock 
and wounding (Figures 4, 5). 

Genome walking may be used to identify regulatory regions associated with a 
harvest-inducible cDNA (see Exainple 3). Altanatively, harvest-induced cDNA 
sequences may be used to isolate regulatory elements associated with one or more 
genomic sequences that are similar to harvest induced cDNA sequences, or that 
hybridize to harvest induced cDNA sequences under specified hybridization 
conditions. Regulatory elements thus obtained are capable of conferring harvest- 
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inducibility upon one or more coding sequences of interest that are operably linked to 
the regulatory elements. 

Therefore, the present invention, also relates to the isolation of regulatory 
elements comprising, 

i) isolating genomic DNA from a plant; and 

ii) identifying a regulatory region within the genomic DNA using 
harvest- induced cDNA, 

The identified regulatory region may then be further characterized by 
sequencing and expression analysis, for example, the regulatory region maybe used to 
drive expression of a marker sequence and the activity of the regulatory region 
analyzed in various tissues and under different environmental or harvest conditions, v. 
The regulatory region maybe identified using genomic walking using PCR primers 
identified from harvest-inducible cDNA's. However, other mefliods that are known 
in the art may also be used. 

A regulatory element identified using the above method may be operably 
linked with a coding sequence of interest, for example a marker gene, see for 
example, but not limited to the construct of Figure 7, and tested to demonstrate 
harvest inducibiUty using any suitable technique, for example but not limited to 
biolistics, protoplast, or Agrobacterium transformation, as disclosed herein. 

Using the above methods, one or more regulatory regions may be identified 
that are capable of conferring harvest-iniducibility upon a coding sequence of interst 
operably linked to fhe regulatory region. Examples, which are not to be considered 
limiting in any manner, of regulatory elements obtained using the methods of the 
present invention include SEQ ID NO's: 4-6 (regulatory regions of H7, nucleotides 1- 
634 of Figure 8; HI 1, nucleotides 1 to about 438 of Figure 9; and H12, nucleotides 1- . 
935 of Figure 10, respectively), firagments thereof, or sequences that hybridize to SEQ 
ID NO's:4-6, or their complement, imder stringent hybridization conditions (e.g. 
hybridization overnight (12-24 hrs) at 42°C in the presence of 50% formamide, 
foUowed by washing using standard conditions, or 5X SSC at about 65 °C for about 12 
to about 24 hours, followed by washing in O.IX SSC at 65°C for about one horn*) or 
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that exhibit a 80% - 100% similarity using sequence alignment protocols, for 
example, but not limited to, BLAST (Program: blastn; Database:nr; low complexity; 
Expect 10; Word size 11), provided the sequence exhibits harvest-inducible 
regulatory element activity. 

The present invention therefore provides DNA constructs useful for producing 
a protein or peptide of interest within a plant. Examples of DNA constructs of the 
present invention, which are not to be considered limiting in any manner, include a 
coding sequence of interest operably linked to a harvest inducible regulatory element, 
or a nucleotide sequence encoding the protein of interest fused to a nucleotide 
sequence encoding a harvest-induced protein, or a portion thereof, where the 
nucleotide sequence encoding the harvest-induced protein or portion thereof is 
operably linked to a harvest-inducible regulatory element. This latter construct may 
be used to ensure stability of a protein of interest following expression in a plant. It is 
also contCTiplated that peptide sequences that facilitate isolation, purification, or both 
of the protein of interest, for example affinity tags, protease cleavage sites, or both 
may be included in the DNA constructs. These DNA constructs may be introduced 
into an expression cassette suitable for plant transformation. 

The present invention is also directed to a method for production of a protein 
or peptide of interest comprising, 

i) introducing a constmct comprising a coding sequence of interest 
operably linked to a harvest inducible regulatory element into a plant, to obtain a 
transgenic plant; 

ii) growing the transgenic plant; and 

iii) harvesting the transgenic plant thereby inducing production of tiie 
protein of interest 

If required, the protein or peptide of interest may be recovered after harvest. 

Additionally, the present invention provides a method for production of a 
protein or peptide of interest comprising, 

i) providing a plant comprising a construct comprising a coding 
sequence of interest operably linked to a harvest inducible regulatory element; 
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ii) growing the plant; and 

iii) harvesting the plant thereby inducing production of the protein or 
peptide of interest. 

If required, the protein or peptide of interest may be recovered after harvest. 

The HI promoters of the present invention are similarly regulated across plant 
families and genera, such that they have applications in crops of various species. 
Thus, this method may be used witii any desired plant, for example but not limited to 
potato, tomato, canola, corn, soybean, alfalfa, pea, lentil, other forage legumes such as 
clover, trefoil, forage grasses such as timothy, ryegrass, brome grass, fescue or other 
cereal grasses used for forage such as barley, wheat, sudan grass, sorgham. 

The present invention also provides a method for enhancing translation, 
stability, recovery, or a combination thereof, of a protein or peptide of interest upon 
harvest of a plant tissue comprising: 

i) introducing a construct comprising a coding sequence of interest 
fused to a nucleotide sequence encoding a harvest-induced protein, or a portion, 
fliereof into a plant to obtain a transgenic plant, where the nucleotide sequence 
encoding the harvest-induced protein or portion thereof is operably linked to a 
harvest-inducible regulatory element; 

ii) growing the transgenic plant; and 

iii) harvesting of the transgenic plant to induce expression of the 
protein or peptide of interest. 

If required, the protein or peptide of interest may be recovered after harvest 

Furthermore, a method for enhancing translation, stability, recovery, or a 
combination thereof, of a protein or peptide of interest upon harvest of a plant tissue is 
also provided, the method comprising: 

i) providing a plant comprising a construct, the construct comprising a 
coding sequence of interest ftised to a nucleotide sequence encoding a harvest- 
induced protein, or a portion thereof, where the nucleotide sequence encoding the 
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harvest-induced protein or portion thereof is operably linked to a harvest-inducible 
regulatory element; 

ii) growing the plant; and 

iii) harvesting the plant to induce expression of the protein or peptide 

of interest. 

If required, the protein or peptide of interest may be recovered after harvest. . 

As the HI promoters of the present invention are similarly regulated across 
plant families and genera, this method may be used with any desired plant, for 
example but not limited to potato, tomato, canola, com, soybean, alfalfa, pea, lentil, 
other forage legumes such as clover, trefoil, forage grasses such as timothy, ryegrass, 
brome grass, fescue or other cereal grasses used for forage such as barley, wheat, 
Sudan grass, sorgham. 

With either of the above methods, the protein of interest may be isolated and 
purified, as required, using standard techniques known in the art. 

The methods provided herein may be used to produce heterologous proteins of 
interest in a plant, and allows for the production of crop plants specifically designed 
for molecular farming wherein plants produce novel proteins with commercial or 
pharmaceutical applications. 

Of particular interest are those proteins or peptides that may have a ther^eutic 
value, for example vaccines. Vaccines produced by the methods of the present 
invention include antigens, such as viral coat proteins or microbial cell wall or toxin 
proteins or various other antigenic peptides, such as swine viral antigen. Other 
proteins or peptides of interest include growth factors, such as epidermal growth 
factor, antimicrobial peptides, such as defensins, and other peptides with 
physiological and immunological properties, such as opioids and cytokines. The 
invention is not limited by the source or the use of the recombinant polypeptide or 
heterologous nucleotide sequence encoding the polypeptide. 
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Examples of other proteins which may be produced in plants or crops by using 
the regulatory elements, constructs, or methods of the present invention, and that may 
be considered as genes of interest, include but are not limited to, industrial enzymes, 
for example, proteases, carbohydrate modifying enzymes such as alpha amylase, 
glucose oxidase, cellulases, hemicellulases, xylanases, mannases or pectinases, (for 
example US 5,824,870, US 5,767,379, US 5,804,694). Additionally, the production 
of enzymes particularly valuable in the pulp and paper industry such as ligninases or 
xylanases is also contemplated (for example US 5,981,835). Other examples of 
enzymes include phosphatases, oxidoreductases and phytases (for example US 
5,714,474). The number of industrially valuable raizymes is large and plants can offer 
a convenient vehicle for the mass production of these proteins at costs anticipated to 
be competitive with fermentation, provided the production system is efficient and 
easily manipulated. Also contemplated! are protein-based elastomers to replace 
allergenic compHDimds such as latex. 

Additionally, molecular farming is also being contemplated for use in the 
production and delivery of vaccines (for example, US 6,136,320, US, 5,914,123, US 
5,679,880, US 5,679,880, US 5,654,184, US 5,612,487, US 6,034,298, WO 
99/37784A1), antibodies (for example, WO 97/2900A1, US 5,959,177, US 5,202,422, 
tFS 5,639,947, US 6,046,037), peptide homiones (for example, US 5,487,991, WO 
99/67401 A2), blood factors and similar therapeutic molecules. It has been postulated 
that edible plants which have been engineered to produce selected therapeutic agents 
could provide a means for drug delivery which is cost effective and particularly suited 
for the administration of therapeutic agents m rural or under developed countries. The 
plant material containing the therapeutic agents could be cultivated and incorporated 
into the diet (for example US 5,484,719). Similarly, plants used for animal feed can 
be engineered to express veterinary biologies that can provide protection against 
animal disease, (for example WO 99/37784A1). 

The DNA sequence encoding the protein of interest may be synthetic, 
naturally derived, or a combination thereof. Dependent upon the nature or source of 
the DNA encoding the polypeptide of interest, it may be desirable to synthesize the 
DNA sequence with codons that represent plant-preferred codons. It is contemplated 
that the coding region of the protein of interest can be joined to the coding sequence 
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of a harvest'inducible protein obtained as described herein, to aid in stability or 
accumulation, or to provide a convenient means to isolate the protein. 

The chimeric DNA constructs of the present invention can further comprise a 
termination (or 3* untranslated) region. A termination region refers to that portion of a 
gene comprising a DNA segment that contains a polyadenylation signal and any other 
regulatory signals capable of effecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by effecting the addition of 
poly adenylic acid tracks to the 3* end of die mRNA precursor. Polyadenylation 
signals are commonly recognized by the presence of homology to the canonical form 
5-AATAAA-3 although variations are not uncommon. 

Examples of suitable termination regions are the 3' transcribed non-translated 
regions containing a polyadenylation signal of Agrobacterium tumour inducing (Ti) 
plasmid genes, such as the nopaline synthase (Nos gene) and plant genes such as the 
soybean storage protein genes and the small subunit of die ribulose-1, 5-bisphosphate 
carboxylase (ssRUBISCO) gene. 

The termination region operably linked to the heterologous gene will be 
priniarily one of convenience, since in many cases termination regions appear to be 
relatively interchangeable. 

The DNA constructs of the present invention can also include further 
enhancers, either translation or transcription enhancers, as may be required. These 
enhancer regions are well known to persons skilled in the art, and can include the 
ATG initiation codon and adjacent sequences. The initiation codon must be in phase 
with the reading finme of the coding sequence to ensure translation of the entire 
sequence. The translation control signals and initiation codons can be firom a variety 
of origins, both natural and synthetic. Translational initiation regions may be 
provided from the source of the transcriptional initiation region, or from the stmctural 
gene. The sequence can also be derived from the promoter selected to express the 
gene, and can be specifically modified so as to increase translation of the mRNA. 
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The DNA constructs of the present invention can further comprise signal 
peptides operably linked to a gene of interest such that expression is targeted to a 
specific organelle. 

A variety of techniques are available for the introduction of DNA into host 
cells. For example, the chimeric DNA constructs may be introduced into host cells 
using standard Agrobacterium vectors by transformation protocols (EP 131320 Bl, 
US 5,591,616, US 5,149,645, US 4,693,976; all of which are incorporated herein by 
refemece). The use of T-DNA for transformation of plant cells has received extensive 
study and is amply described in EP 120516 (also see Hoek^ia et al., 1985, Chapter 
V, In: The Binary Plant Vector System OfTset-drukkerij Kanters B.V., Alblasserdam; 
Knauj^ et al., 1983, Genetic Analysis of Host Range Expression by Agrobacterimn, p. 
245, In: Molecular Genetics of the Bacteria-Plant Interaction, Puhler, A. ed., 
Springer-Verlag, NY; and An et. al., 1985, EMBO J., 4:277-284, which are 
incorporated herein by reference). 

The use of non-Agrobacterium techniques permits the use of the constructs 
described herein to obtain transformation and expression in a wide variety of 
monocotyledonous and dicotyledonous plants and other organisms. These techniques 
include biolistics (US 5,865,796, US 5,120,657, US 5,371,015, US 5,179,022; which 
are incorporated herein by reference), electroporation (US 5,859,327, US 6,002,070; 
Fromm et al., 1985, Proc. Natl. Acad. Sci. USA, 82:5824-5828; Riggs, and Bates, 
1986, Proc. Natl. Acad. Sci. USA 83:5602-5606; which are incorporated herein by 
refCTence), microinjection of protoplasts, (US 4,743,548, which is incorporated herein 
by reference), penetration of cells with timgsten whiskers, (US 5,302,523, US 
5,464,765, which are incorporated herein by reference), lasers, (US 5,013,660, which 
is incorporated herein by reference), sonification, (US 5,693,512, which is 
incorporated herein by reference) or PEG-mediated DNA uptake (Potrykus et al., 
1985, Mol. Gen. Genet, 199:169-177; US 5,453,367, which are incorporated herein 
by reference). 

The expression cassette may be joined to a marker for selection in plant cells. 
Conveniently, the marker may be resistance to a herbicide, eg phosphintfaricin or 
glyphosate, (US 5,4553,367, US 4,940,835, US 5,648,477) or an antibiotic, siich as 
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kanamycin, 6418, bleomycin, hygromycin, chloramphenicol, (for example US 
5,1 16,750, US 6,048,730) or the like. Similarly, enzymes providing for production of 
a compound identifiable by colour change such as GUS (pD -glucuronidase), or 
limiinescence, such as luciferase or GFP are also useful. 

Also considered part of this invention are transgenic plants containing the 
gene construct of the present invention. Methods of regenerating whole plants from 
plant cells are known in the art, and the method of obtaining transformed and 
regenerated plants is not critical to this invention. In general, transformed plant cells, 
are cultured in an appropriate medium, which may contain selective agents such as 
antibiotics, where selectable markers are used to facilitate identification of 
transformed plant cells. Once callus forms, shoot formation can be encouraged by 
employing the £q[>propriate plant hormones in accordance with known methods and the 
shoots transferred to rooting medium for regeneration of plants. * The plants may thm 
be used to establish repetitive generations, either from seeds or using vegetative 
propagation techniques. 

Plants thus obtained may be cultivated and used for the production of various 
proteins. It is envisioned that for some appUcations the harvested material will be 
subject to purification and the heterologous protein isolated in a substantially pure 
form. In other instances the harvested plant material will be used as edible or oral- 
vaccines or ther^eutic agents. In addition, the foreign protein of interest may be 
purified from the harvested plant material and may be formulated into a form for oral 
use or an injectable dosage form. In still other examples the harvested plant material 
may be used directly in an industrial process. Thus, the isolation of harvest inducible 
DNA sequences allow for many strategies for the production of heterologous proteins. 

The above description is not intended to hmit the claimed invention in any 
manner, fiutherinore, the discussed combination of features might not be absolutely 
necessary for the inventive solution. 

The present invention will be further illustrated in the following 
examples. However it is to be understood that these examples are for illustrative 
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purposes only, and should not be used to limit the scope of the present invention in 
any manner. 

Examples 

Example 1: Isolation of harvest-inducible (HI) cDNA clones 

HI cDNAs were isolated from a cDNA subtractive library, made from mKNA 
obtained from field harvested alfalfa, as shown in Figure 1, using a PCR-Select™ kit 
from ClonTech (Protocol #Ptl 117-1, www.clontech.com). Briefly, this technique 
compares two populations of mKNA and obtains clones of genes that are expressed in 
one population but not in the other. 

First, two mRNA populations were converted into cDNA: the cDNA that 
contained the harvest-specific transcripts, referred to as the 'tester'* cDNA and the 
reference cDNA from the non-harvested plants, referred to as "driver** cDNA. The 
tester and driver cDNAs were digested with Rsa I (a four-base-cutting restriction 
enzyme that yields blunt ends). The tester cDNA was subdivided into two portions, 
and each I i gated with a different cDNA adaptor. The ends of the adaptor do not have a 
phosphate group, so only one strand of each adaptor attaches to the ends of the 
cDNA. The two adaptors have stretches of id^tical sequence to allow aimealing of 
the PCR primer once the recessed ends have been filled in (See Figure. 2). 

Two hybridizations were then performed. In the firsts an excess of driver was 
added to each sample of tester. The samples were then heat denatured and allowed to 
anneal, generating the type a, b, c, and d molecules in each sample (see Figure 1). 
The concentration of high- and low-abundance sequences is thought to be equalized 
among the type a molecules because rednnealing is faster for the more abundant 
molecules due to the second-order kinetics of hybridization. At the same time, the 
single stranded (ss) type a molecules are significantly enriched for differentially 
expressed sequences, as cDNAs that are not differentially expressed form type c 
molecules with the driver. 
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During the second hybridization, the two primary hybridization samples were 
mixed together without denaturing. As a resiilt, only the remaining equalized and 
subtracted ss tester cDNAs could reassociate and fomi new type e hybrids. These new 
hybrids are double stranded (ds) tester molecules with different ends, which 
correspond to the sequences of adaptors 1 and 2R (Figure 2). Fresh denatured driver 
cDNA was added, without denaturing the subtraction mix, to further enrich fraction e 
for differentially expressed sequences. After filling in the ends by DNA polymeric, 
the type e molecules — ^the differentially expressed (harvest-inducible) tester 
sequences — ^have different annealing sites for the nested primers on their .5' and 3* 
ends. 

The entire population of molecules was then subjected to PGR to amplify the 
harvest-inducible sequences. During PGR, type a and d molecules are missing primer 
annealing sites, and thus cannot be amplified. Due to the suppression PGR effect, 
most type b molecules form a pan-like structure that prevents their exponential 
amplification (see Suppression-PGR effect, below). Type c molecules have only one 
primer annealing site and can only be amplified linearly. Only type e molecules, 
which have two different adaptors, can be amplified exponentially. These are .the 
equalized, differentially expressed sequences specific to harvested tissue. 

Next, a secondary PGR amplification was performed using nesfted primers to 
further reduce any background PGR products and to enrich for harvest-specifc 
sequences. The cDNAs were then directly inserted into Topo™, a T/A cloning vector 
&om Invitrogen. 

Su ppression-PGR 

The PGR-Select cDNA adaptors are engineered to prevent undesirable 
amplification duriiig PGR using suppression PGR (U,S. Patent #5,565,340). 
Suppression occurs when complementary sequences are present on each end of a ss 
cDNA. During each primer annealing step, the hybridization kinetics strongly favor 
(over annealing of the shorter primers) the formation of a pan-like secondary structure 
that prevents primer annealing. When occasionally a primer anneals and is extended, 
the newly synthesized strand will also have the inverted terminal repeats and form 
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another pan-like structure. Thus, during PGR, nonspecific amplification is efiBciently 
suppressed, and specific amplification of cDNA molecules with diffCTent adaptors at 
both ends can proceed normally. The 5* ends of Adaptors 1 and 2R have an identical 
stretch of 22 nucleotides (Figure 2). Primary PGR therefore requires only one primer 
for amplification, eliminating the problem of primer dimerization. Furthemiore, the 
identical sequences on the 3' and 5* ends of the differentially expressed molecules 
introduce a slight suppression PGR effect. Since these identical sequences are the 
same length as PGR Primer 1, the suppression effect becomes significant only for 
very short cDNAs (under 200 nt), because the formation of pan structures for shorter 
molecules is more efficient. Thus, longer molecules are preferentially enriched. This 
enrichment for longer molecules balances the inherent tendency of the subtraction 
procedure to favor short cDNA firagments, which are more efiBciently hybridized, 
amplified, and cloned than longer fragments. 

Plant material 

The field of alfalfa (c.v. Gala, Northrup King), located on the south edge of 
Guelph, was in its second year after planting, and had already undergone its first 
harvest of the season. Plants at the bud stage and ready for the second harvest were 
cut approximately 8 cm from the base from a 1.0 m^ area The tenq)erature in the field 
was approximately 25-28^G, and the harvesting was performed at noontime to avoid 
humidity. The control, non-harvest-treatment plant tissue was immediately frozen in 
liquid nitrogen. The harvest-treatmOTit sample was laid on the ground in a swath to 
wilt, as is done during conventional harvesting of this crop. After one hour, the 
harvested plant tissue was brought back to the lab, wrapped in tinfoil and left at 
ambient temperature (20"* C) on the brach. 

The leaves were collected for the analysis at different harvest times - 30 
minutes, 45 minutes, 2 hours, 6 hours, 24 hours. Total RNA was isolated from both 
non-harvested and harvested plant materials. cDNA was generated from both tissues 
with HI samples designated as the tester population and the non-harvested samples 
were designated as the driver population. Harvest-inducible cDNAs were inserted into 
TOP02.1 vector (Invitrogen). 
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Twelve cDNA clones^ ranging in size from 180 to 500bp, were obtained using 
the subtractive protocol outlined above. These clones were sequenced to detranine 
redundancy and to select candidates for further analysis. Seven clones of the 12 were 
independent and 4 (HI, H7, HI 1, and H12) were selected for Northern analysis. Of 
these, H7 (SEQ ID NO:l), Hll (SEQ ID NO:2) and H12 (SEQ ID NO:3) showed 
substantial and lengthy (>24 h) up-regulation following harvest and no transcription 
in non-harvested plants (see Northern blots. Example 2) 

The DNA sequence of selected clones was determined and GeneBank searches 
performed using BLAST searching algorithm (default parameters). 

Isolation of Complete cDNA Clones 

To isolate the regulatory regions of H7 and HI 1 the complete coding region of 
these genes was identified. This was done by extending our candidate cDNA clones 
in both the 5' and 3* directions using the RACE (rapid amplification of cDNA ends) 
method. 

Specifically, a cDNA population was generated from alfalfa leaves (c.v. Gala) 
grown in a greenhouse 12 hoius after harvesting using the SMART ™ RACE cDNA 
Amplification Kit from ClonTech according to manufacturer's instructions. 
Harvested plants were wilted for one hour in the greenhouse followed by wrapping in 
tin foil and incubation on the lab bench at 20^C. By this method of repeated "cDNA 
walking" and isolation of many cDNA clones overlapping each other and the original 
cDNA clone isolated by subtraction, the fiill-length transcripts were accurately 
determined for H7 (SEQ ID NO:l) and Hll (SEQ ID NO:2). The regulatory regions 
flanking the coding region were then isolated by genomic walking (see below). 

As H12 (SEQ ID NO:3) was virtually identical to an alfalfa cDNA akeady 
characterized and resident in GenBank, we did not extend HI 2 by RACE, but rather 
performed genomic walking upstream of the 5' end of the cDNA based on the 
sequence data available. The sequence of H12 shows homology to a cDNA fix>m 
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alfalfa that presumably encodes the enzyme CcoMT, thought to be involved in lignin 
fomiation (see Figure 7 for sequence comparison). 

Example 2: Analysis of expression patterns of harvest inducible (HI) 

cDNAs 

Northem blots were done using standard protocols (Sambrook et al, 1989, in 
'•Molecular cloning : a laboratory manual", 2"*^ edition. Cold Spring Harbor, N.Y. : 
Cold Spring Harbor Laboratory). Equivalent amoimts of total KNA from harvest- 
induced, heat-shock treated and wounded leaf tissue were used for hybridization. The 
hybridizations were overnight (12-24 hrs) at 42*'C with ^^P-labelled HI cDNA 
presence of 50% formamide, followed by washing using standard protocols 
(Sambrook et al, 1989). A wounding treatment was applied to alfalfa plants by lightly 
scoring a leaf with a surgical blade on the leaf surface. The wounded leaves were 
removed from the plants for analyses 30 minutes, 6 hom and 24 hours post treatment. 
Heat-shock treatments were performed by placing potted alfalfa plants into an oven 
for 15 minutes or 30 minutes at 38^C. The tissue samples were collected from the 
plants immediately following heat treatment. mRNA accimiulation for cDNA clones 
H7, HI 1 and H12 were examined imder harvest conditions. The Northem analysis 
results showed significant mRNA accumulation following harvesting but not 
wounding (Table 1, Figures 3-6). 

Table 1. Relative accumulation of HI cDNAs following harvesting, woimding and 
heat shock treatments compared with untreated tissue. 



Relative transcript level under different treatments 

cDNA clones 



harvest heat shock wounding 

m +* ? ? 

H7 ++ - - 

Hll -H- +++ ? 

H12 -H- - ? 
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* and "++" are results compared with control sample where control is consider as 

Northern analysis of H7 (SEQ ID N0:1) expression before or after harvest is 
shown m Figure 3. H7 expression increases following harvest, however, no 
expression is observed pre-harvest, or following wounding or heat shock treatments 
(data not shown). 

Expression of Hll (SEQ ID NO:2) following harvesting or heat shock is 
shown in Figure 4. Increased expression is observed following harvesting or heat 
shock treatment. Similarly, increased expression of Hll is observed following 
wounding (Figure 5). 

H12 (SEQ ID NO:3) expression is shown in Figure 6, where an increase in 
expression is observed following harvest of plant material. No expression is observed 
in pr(B-harvested tissue. A low level of expression is detected in response to a heat 
shock treatment. 

Example 3: Isolation of genomic sequences and promoter regions of ED genes 

Alfalfa leaf tissue was collected from plants grown in the greenhouse. 
Genomic DNA was isolated using a method modified fi-om Davies (Davies LG, 
Dibner MD, Batty JF: Basic methods in molecular biology. Elsevier, NY 1986, which 
is incorporated herein by reference). Constraction of the genomic walking "library" 
was performed according to the manufacturer's manual (GenomeWalker™ Kits 
CLONTECH, USA PT116-1). DNA from colonies was sequenced to find those 
containing inserts overlapping the cDNA-labelled cDNA clones H7, Hll and H12 
were used for screening of the library. 

As a result of this screening, corresponding genomic DNAs, of H7 (SEQ ID 
NO:7, Figure 8), Hll (SEQ ID NO:8, Figure 9), and H12 (SEQ ID NO:9, Figure 10) 
were obtained. Further analysis of these genomic DNA*s was carried out to identify 
the associated regulatory regions of H7 (SEQ ID NO:4), Hll (SEQ ID NO:5) and 
H12(SEQIDNO:6). 
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The regulatory regions of fhese genes may be used to drive the expression of a 
coding sequence of interest, for example, but not limited to the coding sequence of 
interest as shown in Figure 7. 

Example 4: Transgenic plants expressing harvest-inducible promoters 

Vector Construction 

In order to test expression of transgenes controlled by the harvest inducible 
(HI) promoters isolated from alfalfa, the HI promoters were fused to the beta- 
glucuronidase (GUS) reporter gene and histochemical assays conducted for GUS gene 
activity, which results in a blue colour in plant tissue (Jefferson et aL, 1987, EMBO J. 
6:3901-7). The putative HI promoter sequences were sub-cloned from Topo 
(InVitrogen) or pBluescript vectors using conventional molecular techniques and 
existing restriction sites or sites created by polymerase chain reaction (PGR). 

The putative promoter region for the H7 cDNA clone was fused to the 5* 
terminus of the GUS gene in the vector pBIlOl (Jefferson et aL, 1987, EMBO J. 
6:3901-7, Fig. la), using HindJJl andXbal; in addition, the H7 promoter was fused to 
the GUS gene in pCAMBIA3301 (GAMBIA), using Kpnl and Xbal (Fig. 1 IB). The 
Hll promoter was fused to pCAMBIA2301, and the H12 promoter was fused to 
pCAMBIA1303 (Fig- UC, D). In all cases, the promoter is also 5' to the GUS gene. 

All of the above vectors are of the binary type, which means they can be 
grown in both E. coli and Agrobacteriuniy the latter for transfer of the regions between 
the left and rigjit borders to the plant genome. 

Transfer o f HI-GUS constmcts to plants 

The binary vectors were transferred to Agrobacterium tumefaciens strain C58 
(Rif res) containing the helper plasmid pMP90. The procedure for cocultivation of 
sterile leaves from 4-week old tobacco plants (cultivar PetH4) and regeneration 
followed the method of F;isher and Guiltinah (Fisher & Guiltman, 1995, Plant Mol 
Biol Rep. 13:278-89). Selection of transgenic tissue and shoots was facilitated by 
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incorporation of kanamycin (300 mg/1) or hygromycin (25 mg/1), dq)ending on the 
vector used (see Fig. 11). 

The binary vectors were also used to transfer HI-GUS constructs to Medicago 
truncatiila according to the seedling infiltration method of Trieu et al. (Trieu AT, et 
al., 2000, Plant J. 22:531-41). 

Histochemical GUS assays for transgene expression 

Tobacco leaves from regenerated plants grown in the greenhouse, and leaves 
and stems from M. truncatiila plants grown from the cocultivated seedhngs were 
incubated in the X-gluc substrate and the green pigments removed for visudization of 
the blue precipitate resulting from GUS enzyme activity (Jefferson et al., 1987, 
EMBO J. 6:3901-7). 

Results 

Analysis of tobacco RO (primary) regenerants, 5-10 plants for each of the 
above constructs, showed GUS gene expression (i.e. blue colouration) 24 hrs 
following harvesting whereas none was evident in plants at time zero or in the non- 
transgenic controls (Fig. 12). Random sampling of portions (leaves and stem/petiole 
sections) of the M trunculata plants that had undergone cocultivation at the seedling 
stage also revealed distinct blue colouration in some sectors, but not in all parts and 
only after the harvesting treatment (Fig. 13). The sectoral pattem of the blue stain 
reflects the chimeric nature of gene transfer, and the cocultivation of intact seedlings. 
Once again, no blue colour was evident in the transgenic plants at time zero or in the 
non-transgenic controls. It is also significant that the extent of blue colouration was 
greater in the case of constracts containing the HI 1 and H7 promote than in plants 
containing the conventional 35S promoter. The lack of blue colour in transgenic 
trunculata plants at time zero demonstrates that the blue colour was not due to 
endogenous GUS activity in residual Agrobacteria. 

The extent and intensity of blue colouration in the HI-GUS plants of the 
present invention noticeably exceeds that of plants containing the GUS gene 
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controlled by the 35S promoter. The latter promoter is derived from the cauliflower 
mosaic virus and is considered to be a constitutive promoter, which provides a high 
level of expression to transgenes, especially in tobacco. Therefore, not only do the HI 
promoters of the present invention avoid the problems associated with constitutive 
expression, but they also exceed the levels of expression provided by one of the 
strongest constitutive promoter available for plants. 

As presently shown, the expression of the HI promoters is tightly regulated in 
that repeatedly no expression has been observed in the transgenic plants of the present 
invention at time zero, and does not appear until several hours after harvesting. It is 
also significant that no additional wounding of the plant tissue is needed to obtain 
high expression levels throughout all harvested tissue, although additional wounding 
or other treatments such as heat may augment expressions levels even further. 

Furthermore, the HI-GUS transgenes show the same harvest-specific induction 
pattems in tobacco and M truncatula as do the native cDNA clones in alfalfa from 
which they were isolated under harvesting conditions. Although M trunculata is a 
close relative of alfalfa, tobacco is quite distant phylogenetically from alfalfa. This 
shows that the HI promoters of the present invention are regulated in a similar pattem 
in other plant fanulies and genera, such as the grass species and have appUcations in 
crops of such species. 

The following table CTable 2) is a summary of the Sequence ID numbers 
defined in the present application. 

Table 2. Sequence ID numbers defined in the present invention. 



Sequence ID No. Description Figure 

SEQ ID NO: 1 Nucleotide sequence of H7 coding region 8 

SEQ ID NO:2 Nucleotide sequence of HI 1 coding region 9 

SEQIDNO:3 Nucleotide sequence of HI 2 coding region 10 

SEQ ID NO:4 Nucleotide sequence of H7 regulatory region 8 

SEQ ID NO:5 Nucleotide sequence of HI 1 regulatory region 9 

SEQ ID NO:6 Nucleotide sequence of H12 regulatory region 10 

SEQIDNO:? Nucleotide sequence of genomic H7 8 

SEQ lb NO: 8 Nucleotide sequence of genomic HI 1 9 

SEQ ID NO:9 Nucleotide sequence of genomic HI 2 10 

SEQ ID NO: 10 Amino acid sequence encoded by H7 coding region 8 
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SEOIDNO'll Amino acid sequimce encoded by H12 coding 

^ * region 

Nucleotide sequence of PCR-Select cDNA 



SEQ ID NO:12 ^ Zr^u"^' ^rimTr 

^ synthesis pnmer 

SEQ ID NO: 1 3 Nucleotide sequence of Adaptor 1 2 

SEQIDNO:14 Nucleotide sequence of Adaptor 2R 2 

SEQ ID NO:15 Nucleotide sequence of PGR primer 1 2 

SEQ ID NO: 1 6 Nucleotdie sequence of nested PGR primer 1 2 

SEQ ID NO: 1 8 Nucleotide sequence of complement (partial) 2 

SEQ ID NO: 19 Nucleotide sequence of complement (partial) 2 

>SEQIDN0:17 Nucleotdie sequence of nested PGR primer 2R 2 

All citations are herein incorporated by reference. 

The present invention has been described with regard to prefCTred 
embodiments. However, it will be obvious to persons skilled in the art that a number 
of variations and modifications can be made without departing from the scope of the 
invention as described herein. 



